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Abstract 

A random intersection graph is constructed by assigning independently to each 
vertex a subset of a given set and drawing an edge between two vertices if and 
only if their respective subsets intersect. In this paper a model is developed in 
which each vertex is given a random weight, and vertices with larger weights 
are more likely to be assigned large subsets. The distribution of the degree of a 
given vertex is characterized and is shown to depend on the weight of the vertex. 
In particular, if the weight distribution is a power law, the degree distribution 
will be so as well. Furthermore, an asymptotic expression for the clustering in 
the graph is derived. By tuning the parameters of the model, it is possible to 
generate a graph with arbitrary clustering, expected degree and - in the power 
law case - tail exponent. 
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bution, clustering, social networks. 

AMS 2000 Subject Classification: 05C80, 9ID30. 


1 Introduction 

During the last decade there has been a large interest in the study of large complex 
networks; see e.g. Dorogovtsev and Mendes (2003) and Newman et al. (2006) and 
the references therein. Due to the rapid increase in computer power, it has become 
possible to investigate various types of real networks such as social contact struc¬ 
tures, telephone networks, power grids, the Internet and the World Wide Web. The 
empirical observations reveal that many of these networks have similar properties. 
For instance, they typically have power law degree sequences, that is, the fraction of 
vertices with degree k is proportional to k~'^ for some exponent r > 1. Furthermore, 
many networks are highly clustered, meaning roughly that there is a large number 
of triangles and other short cycles. In a social network, this is explained by the 
fact that two people who have a common friend often meet and become friends, 
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creating a triangle in the network. A related explanation is that human populations 
are typically divided into various subgroups - working places, schools, associations 
etc - which gives rise to high clustering in the social network, since members of a 
given group typically know each other; see Palla et ah (2005) for some empirical 
observations. 

Real-life networks are generally very large, implying that it is a time-consuming 
task to collect data to delineate their structure in detail. This makes it desirable 
to develop models that capture essential features of the real networks. A natural 
candidate to model a network is a random graph, and, to ht with the empirical 
observations, such a graph should have a heavy-tailed degree distribution and con¬ 
siderable clustering. We will quantify the clustering in a random graph by the 
conditional probability that three given vertices constitute a triangle, given that 
two of the three possible links between them exist. Other (empirical) definitions 
occur in the literature - see e.g. Newman (2003) - but they all capture essentially 
the same thing. 

Obviously, the classical Erdos-Renyi graph will not do a good job as a network 
model, since the degrees are asymptotically Poisson distributed. Moreover, existing 
models for generating graphs with a given degree distribution - see e.g. Molloy and 
Reed (1995, 1998) - typically have zero clustering in the limit. In this paper, we 
propose a model, based on the so-called random intersection graph, where both the 
degree distribution and the clustering can be controlled. More precisely, the model 
makes it possible to obtain arbitrary prescribed values for the clustering and to 
control the mean and the tail behavior of the degree distribution. 

1.1 Description of the model 

The random intersection graph was introduced in Singer (1995) and Karohski et al. 
(1999), and has been further studied and generalized in Fill et al. (2000), Godehardt 
and Jaworski (2002), Stark (2004) and Jaworksi et al. (2006). Newman (2003) and 
Newman and Park (2003) discuss a similar model. In its simplest form the model is 
defined as follows. 

1. Let V = {1,... , n} be a set of n vertices and A a set of m elements. For 
p G [0,1], construct a bipartite graph B{n,m,p) with vertex sets V and A 
by including each one of the nm possible edges between vertices from V and 
elements from A independently with probability p. 

2. The random intersection graph G{n,m,p) with vertex set V is obtained by 
connecting two distinct vertices i,j G V if and only if there is an element 
a G M such that both i and j are adjacent to a in B{n, m,p). 

When the vertices in V are thought of as individuals and the elements of A as social 
groups, this gives rise to a model for a social network in which two individuals are 
joined by an edge if they share at least one group. In the following, we frequently 
borrow the terminology from the field of social networks and refer to the vertices as 
individuals and the elements of A as groups, with the understanding that the model 
is of course much more general. 
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To get an interesting structure, the number of groups m is typically set to m = 
[n“J for some a > 0; see Karohski et al. (1999). We will assume this form for m in 
the following. Let Di be the degree of vertex i G V in G{n,m,p). The probability 
that two individuals do not share a group in B{n,m,p) is {1 — It follows that 

the edge probability in G{n,m,p) is 1 — (1 — and hence the expected degree is 

E[A] = in-l)il-il-p^r) 

= (n — 1) [mp^ + 0{rin?‘p^)) . 

To keep the expected degree bounded as n —>■ oo, we let p = for some 

constant 7 > 0. We then have that E[Zli] —)■ 7 ^. 

Stark (2004; Theorem 2) shows that in a random intersection graph with the 
above choice ofp, the distribution of the degree of a given vertex converges to a point 
mass at 0, a compound Poisson distribution or a Poisson distribution depending on 
whether a<l,a = lora>l. This means that the current model cannot account 
for the power law degree distributions typically observed in real networks. 

In the above formulation of the model, the number of groups that a given individ¬ 
ual belongs to is binomially distributed with parameters m and p. A generalization 
of the model, allowing for an arbitrary group distribution, is described in Godehardt 
and Jaworski (2002). The degree of a given vertex in such a graph is analyzed in 
Jaworski et al. (2006), where conditions on the group distribution are specified under 
which the degree is asymptotically Poisson distributed. 

In the current paper, we are interested in obtaining graphs where non-Poissonian 
degree distributions can be identified. To this end, we propose a generalization of 
the original random intersection graph where the edge probability p is random and 
depends on weights associated with the vertices. Other work in this spirit include for 
instance Chung and Lu (2002:1,2), Yao et al. (2005), Britton et al. (2006), Bollobas 
et al. (2007) and Deijfen et al. (2007). The model is defined as follows: 

1. Let n be a positive integer, and define m = [,5n“J with a,/3 > 0. As before, 
take V = { 1 ,..., n} to be a set of n vertices and A a set of m elements. Also, 
let {Wi} be an i.i.d. sequence of positive random variables with distribution 
F, where F is assumed to have mean 1 if the mean is finite. Finally, for some 
constant 7 > 0 , set 

Pi = 7lTin-(^+“)/2 (1) 

Now construct a bipartite graph B{n,m,F) with vertex sets V and A by 
adding edges to the elements of A for each vertex i £ V independently with 
probability pi. 

2. The random intersection graph G{n, m, F) is obtained as before by drawing an 
edge between two distinct vertices i,j G V if and only if they have a common 
adjacent vertex a£ Ain B{n,m, F). 

In the social network setting, the weights can be interpreted as a measure of 
the social activity of the individuals. Indeed, vertices with large weights are more 
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likely to join many groups and thereby acquire many social contacts. There are 
several other examples of real networks where the success of a vertex (measured 
by its degree) depends on some specific feature of the vertex; see e.g. Palla et al. 
(2005) for an example in the context of protein interaction networks. Furthermore, 
an advantage of the model is that it has an explicit and straightforward construction 
which, as we will see, makes it possible to exactly characterize the degree distribution 
and the clustering in the resulting graph. 

1.2 Results 

Our results concern the degree distribution and the clustering in the graph G(n, m, F) 
as n —>■ oo. More precisely, we will take the parameters a, /3, 7 and the weight dis¬ 
tribution F to be fixed (independent of n) and then analyze the degree of a given 
vertex and the clustering in the graph as n — 00 . It turns out that the behavior 
of these quantities will be different in the three regimes a < 1 , a = 1 and a > 1 
respectively. The interesting case is a = 1, in the sense that this is when both the 
degree distribution and the clustering can be controlled. The cases a < 1 and a > 1 
are included for completeness. 

As for the degree, we begin by observing that, if F has finite mean, then the 
asymptotic mean degree of vertex i, conditional on Wi, is given by jS'y'^Wi for all 
values of a. 

Proposition 1.1 Let Di he the degree of vertex i a V in a random intersection 
graph G(n, m, F) with m = [/3n“J and pi as in If F has finite mean, then, for 
all values of a > 0, we have that E[ilj|ITj] —>■ as n ^ 00 . 

Proof. We prove the claim for vertex i = 1. Define 

Wj = Wj ■ and W” = Wj ■ l{i 4 /.>„i/ 4 } 

and let D' and D" denote the degree of vertex 1 when {Wj}jj^i are replaced by 
{IFj} and {W-} respectively, that is, D' is the number of neighbors of 1 with weight 
smaller than or equal to and D” is the number of neighbors with weight larger 
than Write p' and p'- for the analog of ([I]) based on the truncated weights. 

Now, conditional on the weights, the probability that there is an edge between 
1 and j is 1 — (1 — piPj)^- To see that E[T)"] —)■ 0 as n —>■ cx), we observe that 

1 — (1 — Pip'jY^ < mpiPj = 

Summing the expectation of the right-hand side over j ^ 1, keeping Wi fixed, gives 
(recall the truncation at 1 in p!])) 

B[D"] < ,57n(^+")/2E[p"] < , 

where both terms on the right hand side converge to 0 as n —>■ 00 since F has finite 
mean. As for D', we have 

1 - {1-pip'jr = + OiWfiW^fn-"^). 
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The sum over j 7 ^ 1 of the expectation of the first term equals / 37 ^ 1 TiE[VF^], where 
E[lTfc] = 1 (since F has finite mean) and the sum of the expectation of the 
second term converges to 0 (since (VFj)^ — Since Dq = D' + D”, this proves 

the proposition. □ 

The following theorem, which is a generalization of Theorem 2 in Stark (2004), 
gives a full characterization of the degree distribution for different values of a. 

Theorem 1.1 Consider the degree Di of vertex i & V in a random intersection 
graph G{n,m,F) with m = [/3n“J and pi as in ([I]), and assume that F has finite 
mean. 

(a) If a <1, then Di converges in distribution to a point mass at 0 as n ^ 00 . 

(b) If a = 1, then Di converges in distribution to a sum of a Poisson(/IjWi) dis¬ 
tributed number of Poisson('y) variables, where all variables are independent. 

(c) If a> 1, then Di is asymptotically Poisson(f3'y‘^Wi) distributed. 

To understand Theorem ll.il note that the expected number of groups that indi¬ 
vidual i belongs to is roughly . If a < 1 and Wi has finite mean, this 

converges to 0 in probability, so that the degree distribution converges to a point 
mass at 0, as stated in (a) (the group size however goes to infinity, explaining why 
the expected degree is still positive in the limit). For a = 1, the number of groups 
that individual i is a member of is Poisson(/ 37 lTi) distributed as n —>■ 00 , and the 
number of other individuals in each of these groups is approximately Poisson( 7 ) 
distributed, which explains (b). Finally, for a > 1, individual i belongs to infinitely 
many groups as n ^ 00 . This means that the edges indicators will be asymptotically 
independent, giving rise to the Poisson distribution specified in (c). 

Moving on to the clustering, write Eij for the event that individuals i,j G V have 
a common group in the bipartite graph F) - that is, Eij is equivalent to 

the event that there is an edge between vertices i and j in G{n, m, F) - and let Pn, 
be the probability measure of B{n, m, F) conditional on the weights {Wi, ..., Wn\. 
For distinct vertices i,j,k € V, define 

^i,j,k ~ (EijlEik, Ejj .), ( 2 ) 

that is, is the edge probability between i and j in G(n, m, F) given that they 
are both connected to k, conditional on the weights. To quantify the asymptotic 
clustering in the graph we will use 



where the expectation is taken over the weights, that is, c{G) is the limiting prob¬ 
ability that three given vertices constitute a triangle conditional on that two of the 
three possible edges between them exist (the vertices are indistinguishable, so indeed 
c{G) does not depend on the particular choice of i,j and k). This should be closely 
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related to the limiting quotient of the number of triangles and the number of triples 
with at least two edges present, which is one of the empirical measures of clustering 
that occur in the literature; see e.g. Newman (2003). Establishing this connection 
rigorously however requires additional arguments. 

-(n) 

The asymptotic behavior of c' A is specified in the following theorem. By 
bounded convergence it follows that c{G) is obtained as the mean of the in-probability- 
limits. 

Theorem 1.2 Let {i,j,k} be three distinct vertices in a random intersection graph 
G{n,m, F) with m = [;dn“J and pi as in {7]). If F has finite mean, then 

(a) 1 in probability for a < 1 ; 

(b) t (1 -t- l3'yWk)~^ in probability for a = 1 ; 

(c) —>■ 0 in probability for a > 1 . 

To understand Theorem ll.21 assume that i and k share a group and that j and k 
share a group. The probability that i and j also have a common group then depends 
on the number of groups that the common neighbor k belongs to. Indeed, the fewer 
groups k belongs to, the more likely it is that i and j in fact share the same group 
with k. Recall that the expected number of groups that k belongs to is roughly 
■ If a > 1, this goes to 0 as n —?• oo. Since it is then very unlikely that 
k belongs to more than one group when n is large, two given edges {i, k} and {j, k} 
are most likely generated by the same group, meaning that i and j are connected 
as well. On the other hand, if a > 1, the number of groups that k belongs to is 
asymptotically infinite. Hence, that i and j each belong to one of these groups, does 
not automatically make it likely that they actually belong to the same group. If 
a = 1 , individual k belongs to fi'yWk groups on average, explaining the expression 
in part (b) of the theorem. 

From Theorem 11.21 it follows that, to get a nontrivial tunable clustering, we 
should choose a = 1. Indeed, then we have c{G) = E[(l -|- and, for a 

given weight distribution F (with finite mean), c(G) can be varied between 0 and 
1 by adjusting the parameters fi and 7 . Furthermore, when a = 1, the degree 
distribution for a given vertex is asymptotically compound Poisson with the weight 
of the vertex as one of the parameters - see Theorem ll.il fb! - and it is not hard to 
see that, if F is a power law with exponent r, then the degree distribution will be 
so as well. Since the mean of F is set to 1, the expected asymptotic degree is 
by Proposition 11.11 Taken together, this means that, when a = 1, we can obtain 
a graph with a given value of the clustering and a power law degree distribution 
with prescribed exponent and prescribed mean by first choosing F to be a power 
law with the desired exponent and then tuning the parameters fi and 7 to get the 
correct values of the clustering and the expected degree. 

The rest of the paper is organized as follows. In Sections [2] and [3l Theorem o 
and Theorem O are proved, respectively. The clustering is analyzed for the im- 
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portant example of a power law weight distribution in Section 01 Finally, Section [5] 
provides an outline of possible future work. 


2 The degree distribution 


We begin by proving Theorem ll.il 

Proof of Theorem II.IL We prove the theorem for vertex i = 1. Write Di = D, 
and denote by N the number of groups that individual 1 belongs to. Conditional on 
Wi, the variable N is binomially distributed with parameters m and pi and thus 

p„(iV = 0) = (1 -Pi)™ > 1 - mpi > 1 - 


For a < 1, the expectation of the last term converges to 0 as n ^ oo, and it follows 
from bounded convergence that P(A^ = 0) = E[P„(A/' = 0)] —)• 1. This proves (a), 
since clearly D = 0 if individual 1 is not a member of any group. 

To prove (b) and (c), first recall the definition of the weights {IF/} and (IF/'} - 
truncated from above and below respectively at vi}l^ - and the corresponding degree 
variables D' and D" from the proof of Proposition 11.11 We have already showed 
(in proving Proposition in]) that E[I1"] —>■ 0, which implies that D" converges to 
0 in probability (indeed P[D'' > 0) < E[I1"]). Hence it suffices to show that the 
generating function of D' converges to the generating function of the claimed limiting 
distribution. To this end, we condition on the weight IFi, which is thus assumed to 
be fixed in what follows, and let X[ (i = 2,..., n) denote the number of common 
groups of individual 1 and individual i when the truncated weights IF/ are used 
for i 7 ^ 1. Since two individuals are connected if and only if they have at least 
one group in common, we can write D' = Yl ?=2 Furthermore, conditional 

on N and {lF/}i> 2 , the random variables X'-, i = 2,...,n, are independent and 
binomially distributed with parameters N and Hence, with P„ 

denoting the probability measure of the bipartite graph B(n,m, F) conditional on 
both {lF/}i >2 and N, the generating function of D' can be written as 


= E 




li=2 


= E 


U=2 


n(l + (i-l)Pn(^'>l 


where t G [0,1]. Using the Taylor expansion log(l + x) = x + O(x^) and the fact 
that 

Pn(X' > 1) = 1 - (1-p')^ = Np[ + 0{N\p[f), 

we get that 


n 

(^1 + (t-1)P„(X'> 1)) = 


( 3 ) 


Defining 

n 

Rn := n (l + - l)Pn(W' > 1)) - 

i=2 
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we therefore have that 


^ /gO(7V2 E(p')') 



Since the product in ([2D is the conditional expectation of with t G [0,1], it takes 
values between 0 and 1 and, since G (0)1]) it follows that Rn G [—1)1]- 

Furthermore, recalling that VF/ < we have for a > 1 that 

n n 

i=2 i=2 


implying that Rn —>■ 0 in probability and thus, by bounded convergence, E[i?„] —)• 0. 
Hence we are done if we show that 

(i) E ^ g/37tVi(er(‘-i)-l) g, = i- 

(ii) E [eO-i)AfEPi] ^ e/37"H/i(i-i) if « > 1, 

where the limits are recognized as the generating functions for the desired compound 
Poisson and Poisson distribution in part (b) and (c) of the theorem, respectively. To 
this end, note that the expectation with respect to N of is given by the 

generating function for N evaluated at the point Since N is binomially 

distributed with parameters m and pi, we have that 


E 


,(i-i)iVEp' 


= E 


1 +pi _ I 


(4) 


For a = 1, we have m = [/3nJ andp' = yW/n Recalling that E[W/] —)• E[VFj] = 1, 
it follows that Ylp'i ^ 1 almost surely. Hence, 


1 +Pi 


di-i)Ep' 


- 1 


L/3nJ 


„/37Tyi(eT'(*-i)-l) 


a.s. as n 


oo. 


and it follows from bounded convergence that the expectation converges to the same 
limit, proving (i). 

For a > 1, define p' = With m = [/3n°‘\ and pi = A 1, 

we get after some rewriting, that 


1 +Pi 


.h-i)Ep' 


-1 



7kFi(t - l)Ylp'i 
n“ 


eh-i)«(i-“)/"Ep' _ A 
(t- ) 


By the law of large numbers, Ylp'i ^ 1 almost surely, and, since (e® — l)/x —)• 1 
as X —^ 0, it follows that the right hand side above converges to almost 

surely as n —>■ oo. By (HD and bounded convergence, this proves (ii). □ 








3 Clustering 


In this section, we prove Theorem 11.21 First recall that Eij denotes the event that 
the individuals i,j E V share at least one group. It will be convenient to extend this 
notation. To this end, for i,j, A; E V, denote by Eiji^ the event that there is at least 
one group to which all three individuals i, j and k belong, and write Eij^ikjk for the 
event that there are at least three distinct groups to which i and j, i and k, and j 
and k respectively belong. Similarly, the event that there are two distinct groups 
to which individuals i and k, and j and k respectively belong is denoted by Ei^jk- 
The proof of Theorem 11.21 relies on the following lemma. 


Lemma 3.1 Consider a random intersection graph G{n, m, E) with m = [/3n"J and 
Pi defined as in m- For any three distinct vertices i,j,k E V, we have that 


fai P (E -C - I n f . 

fa'l — j^(3+a)/2 ' 1 j’ 

(b) + o ( 5152 ) ; 

(c) + o : 

(d) Pn(E„„Eti,jo = o ■ 

Proof. As for (a), the probability that three given individuals i, j and k do not 
share any group at all is (1 — PiPjPk)'^- Using the definitions of m and the edge 
probabilities {pi}, it follows that 


^n{Eijk) = 1 - (1 -PiPjPk)'^ = 


fi^^WjWjWk 

^{3+a)/2 


+ 0 


'w^wfwf 


n 


3+a 


To prove (b), note that the probability that there is exactly one group to which 


both i and j belong is mpiPj{l — PiPj)^~^ = rnpipj + 0{m^p~pj). Given that i and 
j share one group, the probability that i and k share exactly one of the other m—1 
groups is (m — l)piPk{l — PiPk)^~‘^ = mpiPk + 0{mfipfp\). Finally, the conditional 
probability that there is a third group to which both j and k belong given that the 


pairs i,j and i, k share one group each is 1 — (1 — PjPk)^~^ = 'mpjPk + 0{m^pjpf.). 
Combining these estimates, and noting that scenarios in which i and j or i and k 
share more than one group have negligible probability in comparison, we get that 


Pn {Eij^ikjk) 


= rn^PiP]Pk + O {m^ph]Pk{PiPj + PiPk + PjPk)) 




+ 0 




'wfwfwl' 


Part (c) is derived analogously. 
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As for (d), note that the event EijkEikjk occurs when there is at least one group 
that is shared by all three vertices i, j and k and a second group shared by either 
i and k or j and k. Denote by r the probability that individual k and at least one 
of the individuals i and j belong to a fixed group. Then r = pk{pi + Pj — PiPj), 
and, conditional on that there is exactly one group to which all three individuals i, 
j and k belong (the probability of this is mpiPjPk{l — PiPjPk)'^~^ = 0{mpiPjPk)), 
the probability that there is at least one other group that is shared either by i and 
k or by j and A: is 1 — (1 — = 0{mr). It follows that 

Pn{EijkEikjk) = 0{rr?piPjPkr) = O 

□ 


(wfwfwl\ 

1 ^(5+a)/2 j ■ 


Using Lemma EH it is not hard to prove Theorem 11.21 
Proof of Theorem II.21 Recall the definition ([2}) of and note that 


^n(,Eij \EikEjk') 


^n{Eijk U 
^n{E^jk U Eik^jk') 


As for (a), applying the estimates of Lemma l3.II and merging the error terms yields 


^n{Eij\EikEjk) > 


'P n(,Eijk) 

T ^niEikJk') 

1 + 0(ITiIUjIUfcn-(3+“)/2) 

1 + IUfc[;d7n(“-b/2 + 0{WiWjWkn-(^-»)/^)]' 


(5) 


By Markov’s inequality and the fact that Wi, Wj and Wk are independent and 
have finite mean, it follows that WiWjWkn~^^~°‘^^'^ goes to 0 in probability when 
a < 1. Similarly, WiWjWkn~^^~^^^^‘^ —)■ 0 in probability. Hence, the quotient in l|5|l 
converges to 1 in probability for a < 1, as claimed. 

To prove part (b), note that, for a = 1, the lower bound ([5]) for converges 
in probability to (1 + (3'yWk)~^- To obtain an upper bound, we apply Lemma EH 
with a = 1 to get that 


Pn{Eij\EikEjk) < 


Pni^Eijk') ~\~ Pn{,Eij ik,jk) 

Pn{Eijk^ T Pn(,Eik,jk) PniEijkEikJk') 

1 + 0{WiWjWkn-^) 
l + Wk{/3-f + 0{WiWjWkn-^)y 


( 6 ) 


Here WiWjWkn~^ converges to 0 in probability by Markov’s inequaliy, and (b) 
follows. 

As for (c), combining the bound in Q with the estimates in Lemma l3 .1 1 yields 


Pn{Eij\EikEjk) < 


„(i-«)/2 + 0{WiWjWkn-y 
ri(i -«)/2 + Wk[y-r + 0{WiWjWkn-y]' 


Since WiWjWkU ^ ^ 0 in probability, this bound converges to 0 in probability for 
a > 1, as desired. □ 
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4 Clustering for a power law weight distribution 


When a = 1, the clustering is given by c{G) = E[(l + /37Wfc)“^]. Here we investigate 
this expression in more detail for the important case that E is a power law. More 
precisely, we take E to be a Pareto distribution with density 


fix) 


(A- 1 )^- 2 ® 


for X > 


A-2 

A-f 


When A > 2, this distribution has mean 1, as desired. The asymptotic clustering 
c{G) is given by the integral 


(A - 2)^-1 
(A - 1)^-2 



(1 + /3jx)~^x~^dx. 


Defining u := (A — 2)/(x • (A — 1)), we obtain 

1 (A-1)2 


c(G) = 


h (A - 2) y 
1 (A- 1 )^ 
/ 37 A (A - 2) 


u 




A- 1 
A-2 


-1 


du 


2 F 1 ( 1, A; 1 + A; —- 


1 /A-1 




where 2 F 1 is the hyper geometric function. For (d'y > (A—1) / (A—2), a series expansion 
of the integrand yields that 


c{G) 


1 (A- 1 )^ 

/37 (A-2) 

1 (A- 1 )^ 

/37 (A-2) 



1 

A: + A 


d> 



A-1 

A-2 



where is the Lerch transcedent. Furthermore, when A is an integer, we get 


c(G) = 


(A-2) 


A-1 


(A-1) 


A-2 


(-/37)^-iln(l + 


A-1 

/37(A - 2) 


A-1 

+ E 

£=1 


i-h) 


\-l-t 


A-1 

A-2 


Figure [T] (a) and (b) show how the clustering depends on A and fd'j respectively. 
For any c € (0,1) and a given tail exponent A, we can find a value of (d'y such that 
the clustering is equal to c. Combining this with a condition on induced by 
fixing the mean degree in the graph, the parameters /3 and 7 can be specified. 


5 Future work 

Apart from the degree distribution and the clustering, an important feature of real 
networks is that there is typically significant correlation for the degrees of neighbor¬ 
ing nodes, that is, either high (low) degree vertices tend to be connected to other 
vertices with high (low) degree (positive correlation), or high (low) degree vertices 
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tend to be connected to low (high) degree vertices (negative correlation). A next 
step is thus to quantify the degree correlations in the current model. The fact that 
individuals share groups should indeed induce positive degree correlation, which 
agrees with empirical observations from social networks; see Newman (2003) and 
Newman and Park (2003). 

Also other features of the model are worth investigating. For instance, many real 
networks are “small worlds”, meaning roughly that the distances between vertices 
remain small also in very large networks. It would be interesting to study the relation 
between the distances between vertices, the degree distribution and the clustering 
in the current model. 

Finally, dynamic processes behave differently on clustered networks as compared 
to more tree-like networks. Most work to date has focused on the latter class. 
In Britton et al. (2008) however, epidemics on random intersection graphs without 
random weights are studied and it is investigated how the epidemic spread is affected 
by the clustering in the graph. It would be interesting to extend this work to 
incorporate weights on the vertices, allowing to tune also the (tail of the) degree 
distribution and study its impact on the epidemic process. 
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(a) The clustering as a function of A for different values 
of Pr- h = 1 (—), ^7 = 5 (-), /37 = 10 (-). 



(b) The clustering as a function of for different values 
of A: A = 2.1 (—), A = 2.5 (-), A = 4 (-). 

Figure 1; Clustering for a power law distribution. 
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